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Summary 

Two novel source coding algorithms are presented which are suitable for the recording of digi- 
tal High Definition Television signals. One of the bit-rate reduction schemes is based on the 
Wavelet Transform and the other on the Lapped Orthogonal Transform (LOT). 

According to the first scheme, a multi-layered, hierarchical description of the source signal is 
obtained by means of a quad-tree, half-band wavelet transform. This transform decomposes the 
input signal into a collection of spectrally non-overlapping sub-bands. Individual sub-bands are 
quantised and entropy coded using a novel predictive arithmetic coding technique. 

The LOT scheme is a block-based technique offering substantial immunity against blocking 
artefacts which are common with the Discrete Cosine Transform (DCT) and other block-based 
coding schemes. The transform coefficients are quantised, scanned according to a predetermined 
spectral order and variable-length coded. 

In this Report, special emphasis is placed on issues related to quality degradation after multiple 
generations of coding and decoding operations. Post-processing between coding generations, 
by means of spatial displacement in both the horizontal and vertical dimensions, is applied as a 
severe test of performance. Further emphasis is placed on the cascadability of different bit-rate 
reduction schemes. This reflects the possibility of having to combine equipment using different 
coding algorithms in the studio. 

Results obtained from simulating the coding algorithms, show noticeable improvement over the 
current state-of-the-art algorithms for still picture encoding based on the DCT, both in terms of 
subjective quality and of measured mean-square error. However, when the proposed algorithms 
are cascaded with DCT-based bit-rate reduction schemes for a limited number of coding- 
decoding operations, the benefit is quickly lost, since the overall quality is determined by the 
quality achieved by the DCT-based system. 
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1. INTRODUCTION 

Digital recording of video signals enables high quality 
reproduction which does not degrade with copying if 
the signals remain in digital form. 

Digital High-Definition Television (HDTV) signals 
contain 4-5.5 times more samples than standard 
component digital television signals whose parameters 
are defined in ITU-R (formerly CCIR) Recommenda- 
tion 601 . Current tape and recording mechanism 
technology is incapable of providing acceptable 
engineering solutions that satisfy the conflicting 
requirements for long playing times, small scale 
mechanics and portability at the raw data rates 
associated with HDTV resolution signals. 

Previous work at R&D has demonstrated the 
feasibility of combining four Dl digital recorders for 
the purpose of recording HDTV signals sampled at 
54 MHz. The scheme, known as Quadrega,'' was 
successfully implemented in 1988. It was based on 
distributing picture data among the four recorders, 
allowing recognizable recovery from any one of four 
tapes. For replay, the outputs from the recorders have 
been de-interleaved to reproduce the original signal. 
Despite the fact that Quadrega has proved the only 
reliable digital VTR for HDTV post-production, it has 
the obvious disadvantages of being expensive, bulky 
and cumbersome to use. 

As an alternative, video compression offers the 
possibility of HDTV recording on conventional- 
definition digital recorders and other storage devices. 
For the compression effort to be worthwhile, the 
advantages of uncompressed digital recording should 
be maintained. Additionally, the established and 
expected operational features of professional video 
recorders should be achieved at the highest quality and 
with the minimum of processing. In particular, random 
access to individual frames of a moving sequence, for 
the purpose of editing, excludes the possibility of 
considering interframe methods exploiting temporal 
redundancy. Moreover, a multi-layered code structure 
is desirable to facilitate modes of recorder operation, 
such as picture-in-shutde and allow recognizable 
playback of recorded material at tape speeds other than 
normal. For professional studio applications, a trans- 
parent quality of decoded signals is essential, even 
after multiple generations of recording. For consumer 
applications, near-transparency is desirable, as well as 
requirements for simple integration into high-volume, 
low-cost equipment for off-air recording, making 



home videos and for other domestic uses, 

Current state-of-the-art algorithms for image compression 
are based on the Discrete Cosine Transform (DCT). 
This is already an integral part of several 
internationally standardised video coding algorithms 
and has been shown to provide good quality pictures at 
low bit rates. 

Recently, video compression equipment suitable for 
digital HDTV recording has been purchased at R&D. 
The equipment, manufactured by BTS, provides the 
functions of digital encoding/decoding as a front/back 
end to a Dl recorder. The compression algorithm used, 
is based on earlier work carried out in the framework 
of the European collaborative project EU95. An 
intraframe coding technique is employed, based on 
entropy coding of quantised DCT coefficients. 
Additional provisions are made to achieve near- 
constant bit-rate by means of pre- and post-coding 
arrangements. A comprehensive assessment of the 
simulated coding performance of this algorithm has 
been performed at R&D prior to equipment purchase. 

Recent advances in image analysis,^"^ and data 
compression*"* have enabled the formulation of 
powerful source coding techniques. Their application 
to digital image compression has been shown to 
overcome some of the barriers imposed by block-based 
transform coding techniques. Additionally, they offer 
significant advantages in terms of quality, natural 
compatibility with video formats requiring lower- 
resolution and graceful performance degradation in 
bandwidth-restrictive environments. 

This Report is an account of research to identify new 
source coding techniques suitable for recording 
purposes. The main component of this study has been 
the developmerit of bit-rate reduction schemes based 
on either the Wavelet Transform or the Lapped 
Orthogonal Transform. Special emphasis is placed on 
issues related to quality degradation after multiple 
generatioris of coding and decoding operarions. 
Post-processing between coding generations, by means 
of spatial displacement in both the horizontal and 
vertical dimensions, is applied as a severe test of 
performance. Further emphasis is placed on issues 
related to the cascadability of different bit-rate 
reduction schemes such as the ones developed and the 
Discrete Cosine Transform. The latter is relevant in 
view of the standardisation effort for still and moving 
picture coding, which is certain to impose a 
DCT-based coding standard for digital television 
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broadcasting. Cascadability is also a measure of 
quality degradation resulting from non-homogeneous 
coding chains in a studio environment. 



2. WAVELET-BASED CODING 

2.1 System overview 

A simplified block-diagram of the proposed system is 
shown in Fig. 1. A quad-tree, half-band wavelet 
transform is used to decompose the input signal into a 
collection of spectrally non-overlapping sub-bands. 
These provide a multi-layered, hierarchical description 
of the source signal. The most significant layer is a 
low-pass filtered and subsampled version of the input. 
In recording applications, this can be used to provide 
compatibility with lower definition video formats, as 
well as facilitate picture review when the tape is 
shuttling. On the other hand, the least significant layers 
convey higher frequency information, but their faithful 
reconstruction is not as critical to the quality of a 
decoded picture. The above description is used to 
influence the allocation of the available coding 
resources to the various layers, with priorities 
determined by visual significance of the information in 
the reconstructed picture. 

Following wavelet tran.sformation individual sub- 
bands are quantised and entropy-coded by using a 
novel predictive arithmetic coding technique. 
Quantisation is the principal means by which 
distortion is induced in the reconstructed picture. The 
quantisation rule used is adapted for each layer of 
information, so that less significant layers are coarsely 
quantised and vice-versa. The lowest frequency 
sub-band is not entropy coded but is represented in 
fixed-length PCM format. This assists in providing 
more rugged reconstruction of the low frequency 
components of pictures. It also contributes to more 
robust code recovery in the event of partial channel 
failure, although this issue is not addressed in this 
work. After quantisation, higher frequency sub-bands 
contain predominantly zero-valued elements which 
exhibit a high degree of spatial correlation. This is 
exploited by using a novel predictive variable-length 
entropy coding scheme based on the arithmetic coding 
principle - explained in more detail in Section 2.3.2. 



wavelet 
transform 



wavelet 
coder 



bit-rote 
control 



An alternative strategy based on block coding has also 
been investigated. 

Decoding consists of variable-length decoding, inverse 
quantisation, interpolative filtering of individual sub- 
bands using the appropriate wavelet kernels, and 
sub-band summation to obtain a reconstruction of the 
source picture. 

Accurate bit-rate control is achieved by varying the 
quantisation parameters used. Fixed step sizes are used 
for the low frequency sub-bands. Then, by taking into 
account the bit-rate reduction achieved for those, it is 
possible to fine tune the quantisers of the high 
frequency sub-bands to obtain the required bit-rate 
control. This is discussed in Section 4. 

2.2 Wavelet transforms 

A wavelet transform is a signal decomposition 
technique which uses a family of real orthonormal 
bases \(f„ „ (x) obtained through translation and dilation 
of a kernel function vj/ (jc) known as the mother 

wavelet : 



V;n,«W 



= 1 



m/i 



\^{2 '"x-n) 



(1) 



where m, n are integers parametrising dilation and 
translation respectively. The wavelet coefficients of a 
signal /f.rj are given by: 



,„W = J /WH'm.nW^ 



(2) 



One way of constructing mother wavelets is by 
determining a function ^(x) which satisfies a two-scale 
difference equation: 



(t.(4 = V2 X h{k) <^(2x-k) 



(3) 



Fig. } - Simplified block diagram of wavelet coder. 



^(x) is commonly referred to in the literature as the 
scaling function. The wavelet kernel is related to the 
scaling function via: 



V/W = V2" ^ g(i) <^{2x-k) 

k (4) 
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where 



^W = jf XH'Cf)*(^-^) 



and 



(5) 



Table J 

Wavelet filter coefficients. 



k 





±1 


±2 


±3 


±4 


^.U) 
^M.) 


0.602949 
0.557543 


0.266864 
0.295636 


-0.078223 
-0.028772 


-0.016664 
-0.045636 


0.026749 
0.0 



h(k) 



= 1^ I<^(f)*c 



vr 



x-k) 



(6) 



Iterated signal decompositions employ suitably dilated 
mother wavelets and scaling functions to obtaiti a 
multi-resoJution representation of the source signal. 
For discrete signals, this can be accomplished via the 
standard filterbank configurations used in sub-band 
decompositions. In this case, the synthesis and analysis 
filters are versions of h (k) and g (k) and correspond to 
high- and low-pass sub-band filters. A variety of 
methodologies has been proposed in the literature for 
the purpose of determining suitable such filters. For 
image coding purposes, it has been shown that 
relaxation of the orthonormaiity requirement yields 
particularly useful filter configurations. A very 
attractive feature of such an alternative is the potential 
for linear phase. A prime exponent of non-orthonormal 
wavelet bases is based upon the enforcement of 
bi- orthogonality conditions.^ Derivative filters have to 
satisfy the following relationships in order to 
guarantee exact reconstruction: 



g{k) = {-\)' h (1-fc) 



g{k) = (-1)* hO-k) 



(7) 



(8) 



where h and g are respectively the low- and high-pass 
reconstruction filters. 



For two-dimensional signals, of particular interest are 
separable filter kernels of compact support. Separabil- 
ity allows the implementation of two-dimensional 
filtering as a cascade of two one-dimensional filtering 
operations; at the same time, compact support ensures 
that only a small number of non-zero coefficients are 
necessary for the implementation of the wavelet 
transform. A typical set of wavelet filter analysis and 
synthesis coefficients h (k) and h (k), taken from 
Reference 5, is shown in Table 1 below: 



A simpHfied view of a wavelet transform implemented 
by two and three iterations of filtering on low 
frequency sub-bands is shown in Fig. 2. A total of 
seven and ten output components (sub-bands) are 
respectively obtained with sub-band 1 containing the 
lowest order spectral information in each direction, 
vertically and horizontally. A simplified view of a 
wavelet filter implemented by a separable, two- 
dimensional, compact support, filter-and-subsample 
structure is shown in Fig. 3 (overleaf). If h (k) and 
g (k) are half-band filters the resulting partition of the 
source signal spectrum after two and three iterations of 
filtering is shown in Fig. 4 (overleaf). 
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Fig. 2 - Wavelet transform by iterated filtering. 

2.3 Coding of wavelet coefficients 

The previous section has described how wavelet 
transform coefficients can be obtained as elements of 
sub-bands that emerge from the iterative application of 
wavelet filtering. For the purpose of encoding these 
coefficients, two processes have been considered in 
cascade: quantisation and entropy coding. Two different 
entropy coding schemes have been investigated; intra- 
band predictive coding and interband block coding, 
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Fig. 3 - Wavelet filtering as a separable 
fdter-and-subsample routine. 

The first scheme employs an arithmetic coding tech- 
nique, while the second operates a Huffman coding 
procedure which uses pre-specified symbol-likelihood 
information. 



2.3.1 Quantisation 

Coefficient quantisation is the principal distortion- 
inducing mechanism in most state-of-the-art image 
coding algorithms. The quantisation strategy adopted, 
has been determined by experiments to assess 
distortion at different quantisation rates. Uniform 
quantisation has been applied individually to each 
coefficient (sub-band element). The same quantiser 
step size was used for elements belonging to the same 
sub-band, while different step sizes were considered 
for different sub-bands. Step sizes were adapted to the 
frequency content of the encoded sub-band. The 
purpose of adaptation has been the control of the 
perceptual impact of quantisation errors by taking into 
account characteristics of the human visual system. As 
a consequence, greater error tolerance has been 
allowed when quantising the elements of higher- 
frequency sub-bands. Non-uniform quantisation 
strategies, approximately matching the statistical 
distributions of sub-band elements, have also been 
investigated. Such strategies have involved Gaussian 
and Laplacian Max-Lloyd quantisers.^ Our 
experiments have shown that their performance is 
inferior when cascaded with the entropy coding 
schemes investigated. 



Uniform quantisation of an unquantised wavelet 
coefficient w^^ ^ of raster scan order n within sub-band 
m, is implemented as a division of that coefficient by 
an appropriate weight Q{m) as shown below: 



■'m, n 



vv 
m,n 

Qitn) 



(9) 



where q^ „ is the quantiser output and the [■] operator 
denotes truncation/rounding to the nearest integer. 
Quantiser weights used for 4:1 compression of test 
pictures are shown in Tables 2 and 3 for the 7- and 
10-sub-band schemes respectively, Quantiser weights 
used for 8:1 compression are shown in Table 4 for the 
7-sub-band scheme. 

Table 2 

Quantiser weights for the 4:1 compression ratio, 

7-sub-band scheme. 



sub-band m 


1 


2 


3 


4 
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7 


Qim) 


1 


2 


2 


2 


4 


4 


4 



Table 3 

Quantiser weights for the 4:1 compression ratio, 

lO-sub-band scheme. 
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Fig. 4 - Spectrum partition with quad-tree wavelet transform. 
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Table 4 

Quantiser weights used for 8: J compression ratio, 

7 -sub-band scheme. 



segment #m 



sub-band m 
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2 
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Qim) 


4 


16 


16 


16 


32 


32 


32 



2.3.2 Intraband coding 

Elements of the lowest frequency sub-band (sub-band 
#1) are not bit-rate reduced any further but are 
represented in fixed-length PCM format. Typically, the 
precision of the PCM representation varies from 8 to 
10 bits for luminance signals and from 7 to 9 bits for 
chrominance signals. The exact number of bits used 
depends upon the number of sub-bands, the gain of the 
wavelet filter and the quantiser weights used. 

Higher frequency sub-bands (sub-band #2 and higher) 
are individually compressed by using a predictive 
coding technique. The particular prediction mechanism 
used, compiles (on-the-fly) likelihood tables from 
sub-band elements previously encountered along the 
direction of coding. This requires a significant amount 
of storage which grows exponentially with the 
amplitude range of predicted elements. In order to 
limit such storage requirements, it is necessary to 
constrain the amplitude excursions of the quantised 
coefficients within a pre-specified range. This is 
achieved, as a pre-processing step to predictive coding, 
by slicing sub-bands into multiple segments; each 
segment is of limited precision and is defined by a 
maximum and a minimum level. Segment elements are 
not allowed to exceed or fall short of these levels. 
Finally, segments are offset so as to have a zero 
minimum level. If q^^ is a quantised wavelet 
coefficient of raster scan order n within sub-band m 
obtained by (9), then the n-th element of the k-i\\ 
segment of the m-i\\ sub-band 5^^ „ is given by the 
following: 



\m, rt ~ '' 



^„,„ < {k-\){S-\) 

^„,„- (^-1){5-1) {k-D S < q^„ < kS-\ 

(10) 



where 



k = 



(11) 



and { , 1 , ... , S-1 } is the amplitude range of a 
segment. This is illustrated in Fig. 5 for 5 = 64. 



quantised sub-bond 




Fig. 5 - Sub-band slicing as a preliminary stage to 
predictive coding. 

Predictive coding is subsequently applied on 
individual segments. This has the added benefit of 
confining possible decoding errors within sub-band 
segments only; which, in turn, allows partial sub-band 
recovery from the remaining constituent segments. An 
overview of the coding scheme is presented in Fig. 6. 
Sub-band slicing is a lossless mapping and minimally 
affects the bit-rate reduction performance. Lossless 
sub-band recovery is achieved simply by summing all 
constituent segments. In our experiments, 6-bit 
segments (5 = 2*) have been used. For a two-element 
predictor, the resulting storage requirement is 512 k 
bytes as will be discussed in the following section. 
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Fig. 6 - Simplified block diagram of intraband 
predictive coder. 

2.3.2.1 Variable-length predictive 
coding 

Higher-frequency sub-band segments are entropy 
coded individually by using a predictive arithmetic 
coding technique^ adapted for multi-symbol source 
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alphabets^. This technique has been previously 
considered for edge map compression apphcations in 
Reference 10. In the current application, a whole 
sub-band segment (a sequence of quantised and 
bit-sliced wavelet coefficients) is examined and a 
single, variable-length, binary codeword is produced. 
Segment elements are examined sequentially, one at a 
time, according to a predetermined order (i.e. raster 
scan). Nevertheless, the output codeword is not 
available before the last element in the input sequence 
has been examined. The length of the output codeword 
is determined by the conditional likelihoods of input 
elements. Conditioning is based upon the examination 
of a prediction template consisting of a few neigh- 
bouring elements (in this application a template of two 
elements). 

If s(i) is the i-th element of an A^ x A'^ segment, then the 
corresponding 2-element prediction template T(i) is 
the set: 



7(0 = [sii-l),s{i-M)\ 



(12) 



Conditional probabilities are computed on-the-fly and 
are updated upon examination of a new input element. 
Consequently, a significant advantage of on-the-fly 
computation is its capability to adapt to input statistics. 

A convenient abstraction, frequently quoted in the 
arithmetic coding literature,* calls for viewing the 
output binary codeword as a point located inside the 
semi-open interval [0,1). Hence, this interval is 
assumed to contain all arbitrary -length binary 
fractions. When input symbol s is encountered, its 
corresponding template T is determined. The ratio 
c {T, s) I c (T) is computed for all s used to partition 
the semi-open interval to proportional non-overlapping 
sub-intervals. The sub-interval corresponding to the 
current input symbol is retained while the remainder of 
the partitioned interval is discarded. The next input 
symbol and its template are subsequently used to form 
a new likelihood estimate (15) and a new partition on 
the retained sub-interval. In between successive 
partitions, statistics c (T, s) and c{T) are updated. A 
simplified view of the arithmetic coding sequence of 
events for a sample input sequence is shown in Fig. 7. 



Computation of conditional likelihoods involves the 
registration of cumulative co-occurrences of template 
elements and input elements. Let c{T) be the number 
of times prediction template T has been encountered 
and c{T, s) be the number of times T and s have 
co-occurred during examination of an input symbol 
string, so that: 



S-\ 
c{T) =X^(^'^) 



(13) 



where {0,1,..., 5-1} is the input symbol 
alphabet (in this application this is identical to the 
amplitude range of a segment). The minimum output 
code string length L is given by the following 
relationship: 



|5-1,5-1) (5-1, S- 1 1 5-1 

L = X f(r ) log c (T) - ^ ^ c {T, s) log c (T, s) 

T='0,0] T=\0.0\ i=0 

.1 If (14) 

The above is equivalent to the entropy of the random 
variable defined by the set of all strings whose 
conditional probabilities are chosen as follows: 



Pis\T) = 



c{T) 



(15) 



2.3.3 Interband coding 

As in the previous scheme, quantised elements of the 
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Fig. 7 - Predictive arithmetic coding using 
neighbourhood template. 
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Fig. 8 - Block formation by suitable combinations of 
sub- band elements. 

lowest frequency sub-band (sub-band #1) are not 
bit-rate reduced any further but are represented in a 
fixed-length PCM format. Nevertheless, for notational 
convenience, they will be taken into account in the 
block formation and block scanning procedures 
described below. Higher frequency sub-bands (sub- 
band #2 and higher) are individually compressed by 
using a block coding technique. Coding blocks are 
suitable arrangements of quantised sub-band elements 
which are subsequently scanned so as to form a one- 
dimensional sequence. This sequence is run-length 
coded using pre-defined Variable Length Coding 
(VLC) tables. 

Square coding blocks of 8 x 8 = 64 elements are 
formed from 10-band sub-band schemes. This 
particular choice of block size coincides with that used 



in block-based transform coders (i.e. DCT). Blocks 
consist of quantised sub-band elements taken from the 
available sub-bands as follows: sub-bands 1-4 contribute 
single elements; sub-bands 5-7 contribute square 
blocks of 4 elements; sub-bands 8-10 contribute square 
blocks of 16 elements, according to the arrangement 
shown in Fig. 8. It is evident that such a block 
formation procedure easily generalises for any quad- 
tree decomposition scheme. For instance, square 
coding blocks of 4 x 4 = 16 elements can be formed 
in an analogous fashion from 7-band sub-band 
schemes. 

The 8x8 square blocks are scanned according to 
either of the patterns shown in Fig. 9. The element 
contributed from the lowest frequency sub-band is 
shaded. The ordering obtained facilitates entropy 
coding by placing low-frequency sub-band elements, 
which are more likely to be non-zero after quantisation, 
before high-frequency sub-band elem.ents. Moreover, 
elements belonging to the same sub-band are scanned 
so that runs of consecutive zero elements are likely to 
be maximised. Due to the block formation procedure 
described above, this scanning pattern is notably 
different from the conventional 'zig-zag' pattern used 
in DCT-based coding schemes. 

Block elements are run-length coded by taking into 
account the above scan sequence. Each non-zero 




F(g. 9 - Scanning patterns for block-based wavelet coding. 
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Fig. 10 - Simplified block diagram of 
interband block coder. 

element is represented by its run-length, which is the 
consecutive number of zero-valued elements preceding 
it in the scan sequence. Amplitudes of non-zero 
elements are entropy -coded using VLC tables. The 
size (number of bits) of the variable length codeword 
representing element amplitude is also recorded. Thus, 
a one-dinnensional scan sequence of elements is 
represented by a sequence of symbols of the type 
{{run-length, size), amplitude). Separate VLC tables 
are used to code {run-length, size) and amplitude 
symbols. Two-dimensional VLC tables are employed 
for the first class of symbols. Truncated codes are used 
for run-lengths so that when a run-length exceeds a 
pre-specified threshold (typically 15) a suitable code 
extension is required. Part of the code table is a 
reserved End-Of-Block (EOB) codeword used to 
indicate the absence of further zero elements within 
the current block. Symbols belonging in the second 
class are coded by means of one-dimensional VLC 
tables. A simplified block diagram of the interband 
block coder is shown in Fig. 10. 



3. LAPPED ORTHOGONAL 
TRANSFORM CODING 

A coding scheme based on the Lapped Orthogonal 
Transform has also been investigated.''^ Tt consists of 
three main functional blocks; the transform block, the 
uniform quantiser and the entropy coder. The last two 
blocks are functionally similar to those employed by 
the JPEG coder. The only difference is in the selection 
of quantiser step sizes. These have been heuristically 
optimised by considering perceptual distortion criteria. 
The VLC tables used have been identical to those 
employed by JPEG.''^ 

3.1 Lapped Orthogonal Transforms 

The Lapped Orthogonal Transform (LOT)''^ is a 
block-based invertible signal transform whose primary 
aim is the reduction of block artefacts in signals 
reconstructed at very low bit rates. These artefacts are 



usually the consequence of coarse quantisation of the 
transform coefficients. 



Whereas, in conventional block-based signal trans- 
forms, input and transformed blocks have identical 
sizes, this is not the case with the LOT. Instead, input 
blocks are larger than transformed blocks. Also, in 
contrast to conventional schemes, input blocks are 
overlapping. Using matrix notation, let: 



jc = [jt(0)j:(l)... jc(n-l)] 



be a collection of n input samples and: 



y = WO)y(l)... y(«-l)], 



a corresponding set of transform coefficients of the 
same size, so that: 



y = T' X 



(16) 



For a conventional Ny.N block transform, T' is the 
transpose of the transform matrix which can be written 
in block-diagonal form as: 



NxN input block 



NxN transform block 
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Fig. }} - Illustrating the principle of block partition in 

lapped transforms. 
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T = 



D 



D 











(17) 



where DisuNxN matrix whose columns are the basis 
functions that define the transform of each block. In 
the case of the LOT, T is given by; 



T = 



D, 



D 



D 



D. 



(18) 



where D, in this case, is an LxN matrix whose 
columns are the basis functions of the LOT and N <L 
< N. D] and Dj are matrices used to transform blocks 
located at the boundaries of the input data. The 
principle of two-dimensional block partitioning in 
lapped transform schemes is illustrated in Fig. 11. 
Neighbouring blocks of input samples overlap by L-N 
samples in each direction. Depending upon the choice 
of D, the LOT can be separable and implementable by 
fast algorithms, features desirable in practical system 
design. A particularly useful LOT for image coding 
purposes, is the one proposed in Reference 13. 
According to this algorithm, D is a superposition of 
odd and even DCT basis functions as follows; 
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Fig. 12(a) - Basis functions for the DCT, 
(b) - Basis junctions for the LOT. 
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D = 



where D^ and D^, are A'^ x N/2 matrices containing the 
even and odd DCT functions respectively, and J is the 
N/2 X N/2 counter-identity matrix. The first three basis 
functions of the DCT and the LOT are shown for 
comparison purposes in Fig. 12, In Fig. 13 the 
frequency response of the lowest-frequency basis 
functions of the DCT, LOT and wavelet transform is 
shown. It can be seen that the wavelet transform results 
in a much 'cleaner' partition of the spectrum. A similar 
conclusion can be drawn by examining higher- 
frequency basis functions. 



4. BIT-RATE CONTROL 

Accurate bit-rate control is important in digital storage 




DCT 

LOT 

wavelet 

Fig. 13 - Frequency response of DCT, LOT and 
wavelet basis functions. 
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applications, such as video recording. Established 
digital recording formats, such as the D-1, specify an 
exact number of bits which can be used to represent a 
video frame. Given that the systems studied in this 
report make use of variable-length coding, accurate 
bit-rate control is an important aspect of system 
design. 

It has already been mentioned that quantisation of 
sub-band elements/transform coefficients is the main 
rate/distortion control mechanism of the coding 
schemes studied. Therefore, bit-rate control can be 
based upon suitable adjustment of the quantiser step 
size used. By taking into account the occupancy level 
of the transmission buffer (i.e. the buffer which stores 
the code to be transmitted) the quantiser step size used 
for the current coding block (i.e transform block, 
macroblock, frame etc.) can be appropriately modified. 
For instance, an under-occupied buffer will allow finer 
quantisation to be applied to the next coding block and 
vice-versa. Although this mechanism has a stabilising 
effect upon the average bit-rate, it cannot guarantee an 
exact bit-rate for each individual frame. Moreover, 
implementing such a scheme for digital recording 
would involve a prohibitive amount of additional 
complexity. The realisation of a variable-length buffer 
would require extra circuitry to distribute information 
from one frame and record it in positions on the tape 
where other frames would be recorded. This would 
have an undesirable effect on the edit operation since 
tape length would not correspond to fixed picture 
segment lengths. 



In Reference 14, bit-rate control is achieved by picture 
pre-analysis. This is a preliminary procedure which 
measures the detail distribution within the entire input 
picture and provides an estimate of the data rate. 
During the coding stage, mismatches between 
estimated and actual rate are detected and the 
quantisation rule is accordingly modified to achieve a 
near-constant rate. 

target 
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Fig. 14 - Bit-rale control hy parallel pre-coding. 



Alternatively, parallel pre-coding using different 
quantiser step sizes is more appropriate for recording 
applications. Such a pre-coding scheme is implemented 
as a bank of quantisers entropy coders, each of which 
takes, as an input, an identical set of sub-band elements/ 
transform coefficients, and outputs a bit-stream 
corresponding to a compressed frame. A simplified block 
diagram of this configuration is shown in Fig. 14. The 
quantiser entropy coder combination (out of a 
collection of N such combinations) whose output bit- 
stream best 'matches' the target rate is selected. 
Side-information is necessary to signal to the decoder 
which quantisation step-size has been used. A variant 
of this approach has been proposed for the LOT 
bit-rate reduction scheme in Reference 11. It is 
claimed that by using the proposed approach, the rate 
can be controlled within 0.1 % accuracy. 

In the case of the wavelet-based bit-rate reduction 
scheme, pre-coding is modified to take into account 
the sub-band structure of the encoded information. 
With reference to Fig. 15, lower frequency sub-bands 
(1, 2, ... , j) are quantised using step sizes that do not 
vary from one frame to the next. The bit-rate allocated 
to them is subtracted from the target rate for the 
current frame and a residual target rate is obtained. 
Parallel pre-coding is then applied to higher frequency 
sub-bands only, according to the scheme discussed 
above. 

The rate allocated to higher frequency sub-bands (>j) 
should not exceed the residual rate computed earlier. 
This is summarised in Fig. 15 where the shaded 
pre-coder block corresponds to the system outlined in 
Fig. 14. It can be seen that the higher frequency 
sub-bands are encoded sequentially. After each 
sub-band is encoded, the rate allocated to it is 
subtracted from the current residual target rate and a 
new residual target rate is computed. This is fed to the 
pre-coder of the next higher frequency sub-band. The 
particular order in which higher frequency sub-bands 
are processed is not important. Using a pre-coder bank 
consisting of 8 parallel branches and using exclusively 
integer weights Q{m) (see Eq. 9), the rate can be 
controlled within 2.5-3% accuracy. Using more 
parallel branches and/or non-integer weights, the 
accuracy of the bit-rate control scheme can be further 
improved. For typical sequences, sub-bands 2-4 and 
2-7 were classified as 'lower frequency' for the 7- and 
the 10-sub-band schemes respectively. The remaining 
sub-bands were classified as 'higher frequency'. This 
classification is relevant to the bit-rate control scheme 
described above and should not be confused with that 
used elsewhere in this report. For non-natural scenes 
(i.e. graphics) a different classification may be 
necessary. Sub-band activity measures can be used to 
provide coding efficiency estimates and therefore 
provide the basis for automated sub-band classification. 
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Finally, it is worth noting that the arithmetic coder 
employed in the wavelet scheme, can be easily 
modified to improve the efficiency of the pre-coder. In 
fact, a modified arithmetic coder can provide the exact 
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Fig. 15 - Bii-rate control for the wavelet-based 
coding scheme. 



size of the code vi/ithout actually computing the code. 
This accelerates the pre-coding procedure considerably, 
since it involves the manipulation of 16-bit registers 
only, as opposed to the FIFO register which is destined 
to store the entire code. In this case, an arithmetic 
post-coder (a coder that will compute the actual code) 
is a necessary complement to the bank of pre-coders. 



5. EXPERfMENTS AND DISCUSSION 

The bit-rate reduction algorithms presented above 
were simulated without employing bit-rate control and 
applied to several HDTV sequences. Experimental 
results reported here are based on one of the most 
critical sequences available, code-named 'crowd'. This 
sequence was shot during the 1992 Wimbledon tennis 
championships using a CCD camera and contains a lot 
of fine detail and bright, saturated colours. The 
luminance component of the sequence was sampled at 
54 MHz to yield frames of 1440 x 1152 samples, 
each at a rate of 25 frames/sec. A 4:2:2 sampling 
structure was used for the chrominance components. In 
this series of experiments the effect of interlace was 
ignored. The first frame of the 'crowd' sequence is 
shown in Fig. I (see Section 9, page 16, for Figs. I - IV). 

Algorithm performance after cascaded encoding- 
decoding operations is very important for studio 
applications, where the picture material is likely to be 
coded more than once before being ready for 
transmission. It is also possible that a picture might be 
additionally processed by special post-production 
tools. This is taken into account in the simulations by 
shifting the picture by two pixels in both directions, 
vertically and horizontally, between each decoding and 
encoding operation. A worst case situation occurs by 
employing a two-pixel shift. The multi-generation 
performance assessment arrangement is shown in Fig. 1 6. 

The multi-generation performance of the bit-rate 
reduction algorithm has been studied at compression 





1 st generation 
assessment 






2nd generation 
assessment 




Fig. J6 - Multigeneration performance assessment with post-processing. 
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- - - - waveiel 8: 1 
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Fig. 17 - Multigeneration performance comparison of 

JPEG and wavelet-based coders without post-processing 

inserted between generations of recordings. 

ratios of 4:1 and 8:1. The first is more relevant to 
professional applications, such as studio recording, 
while the second is adequate for consumer applications. 
PSNR* values are plotted against the number of 
coding generations for each of the luminance (Y) and 



waveiet4:1 

JPEG 4:1 

waveiet 8: 1 

JPEG 8:1 

Fig. 18 - Multigeneration performance comparison of 

JPEG and wavelet-based coders with post-processing 

inserted between generations of recordings. 

the tvi/o chrominance (U,V) components. In all cases, 
results are compared to those obtained by the JPEG 
international standard algorithm for still picture 
encoding''^ for the same compression ratio and for a 
total of 10 generations. For comparison purposes, the 



Tlie peak signal-to-noise ratio (PSNR) lias been used forttie assessment of algorithm bit-rate reduction performance. This is a universally 
accepted distortion measure in digital video bit-rate reduction. Although there may be cases where it does not accurately reflect distortion, 
as perceived by human viewers, it is nevertheless useful towards establishing a quantitative measure of quality. 
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Fig. 19 - Multigeneration performance comparison of 

cascaded bit-rate reduction schemes with no 

post-processing inserted between generations of recordings. 



DC coefficients obtained from the DCT are not 
differentially coded, as outlined in JPEG, Instead, they 
are represented in fixed-length PCM format as they 
would be for a digital HDTV recorder using the JPEG 
scheme. The same representation is used for the 
low-frequency coefficients of the wavelet- based 
scheme. This measure allows both schemes to 



maintain a fraction of the total code as a fixed-length 
string, useful for confidence viewing and error control 
as already mentioned. 

In Fig. 17, multi-generation performance without 
introducing any post-processing between decoding and 
encoding, is recorded, After the first generation, at a 
compression ratio of 4:1, the wavelet-based coder 
performs approximately 4 dB better than JPEG in the 
case of luminance and 3 dB better in the case of 
chrominance. At a ratio of 8:1 the difference drops to 
approximately 1 ,5 dB in both cases. No significant loss 
of performance as a result of multiple copying was 
measured except in the case of 4:1 compression. In this 
case, the wavelet-based scheme lost 1.5 dB and 
0.75 dB respectively for luminance and chrominance, 
after ten generations of coding. 

In Fig. 18, multi-generation performance by intro- 
ducing post-processing between generations of coding 
is recorded. As mentioned previously, this is 
accomplished by shifting the picture by two pixels in 
both directions, vertically and horizontally. For the 
luminance component, the wavelet-based scheme is 
significantly more resistant to multi-generation 
deterioration, as is evident from the slope of the PSNR 
curves. Most notably, from the fourth generation 
onwards, the wavelet-based scheme at a compression 
ratio of 8; 1 outperforms JPEG at 4: 1 . For chrominance 
components, differences between both schemes are not 
so pronounced, although JPEG is consistently worse. 
By the tenth generation of coding, the wavelet-based 
schemes are still performing better than JPEG by 
approximately 1 dB. On the other hand, the observed 
PSNR drop between first and second generations is 
greater for the wavelet coder. 

As expected, given the relatively low degree of 
compression, decoded pictures are indistinguishable 
from the originals, at nominal viewing distances, 
except in the most extreme cases. In our experiments, 
the largest number of generations was 10 and 8:1 
compression gave the most visible impairments. A 
detailed view of the original first frame of 'crowd' is 
shown in Fig. II to allow for comparisons between 
systems. Detailed views of the two reconstructions 
after ten generations with post-processing applied at 
8:1 compression ratio, are shown in in Fig. Ill(a) for 
JPEG and Fig. ni(b) for wavelets. Difference pictures 
are produced by subtracting the reconstmcted pictures 
from the originals. To highlight coding losses, 
differences are further amplified by a factor of two. 
Differences of luminance signals are shown in 
Fig. IV(a) for JPEG and Fig. IV(b) for wavelets. A 
significant amount of high-frequency detail appears to 
be absent from the JPEG reconstruction while the 
wavelet-derived difference appears as being much 
more random and evenly spread over the entire image 
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plane. The pictures shown confirm the superiority of 
the wavelet-based scheme in terms of the portrayal of 
fine detail. 

A different set of experiments have been performed in 
order to assess the cascadability of combinations of the 
bit-rate reduction schemes developed. This type of 
assessment is important because it simulates situations 
in which different coding techniques are applied to 
different sections of the processing chain. For a total of 
four generations of cascaded encoding/decoding the 
results are summarised in Fig. 19 (previous page). For 
comparison purposes, multi-generation performance 
using a single type of coding technique is also 
indicated. As before, PSNR is plotted against the 
number of coding generations for each of the three 
luminance and chrominance components for a 
compression ratio of 4: 1 . Our results show that there is 
a sharp increase in measured mean-square error 
between the first and the second generations of coding, 
when different coding schemes are employed in 
alternate order. Between subsequent generations the 
increase is not as emphatic but tends to level out 
progressively instead. The allocation of bit-rate among 
luminance and chrominance components has not been 
kept fixed for the three different systems tested, 
namely DCT, LOT and the wavelet-based scheme. 
Instead, the allocation yielding equal visual quality 
has been used for each scheme as shown in Table 5. 

Table 5 

Bit-rate allocation among luminance and chrominance 

components. 





Y(bpp) 


U(bpp) 


V (bpp) 


DCT 

LOT 

Wavelet 


3.2 
3.2 
2.7 


0.4 

0.4 
0.65 


0.4 
0.4 
0.65 



international standard algorithm for still picture 
encoding, were noted. This was particularly true after 
multiple generations of coding with post-processing 
inserted between encoding and decoding. Moreover, 
for recording applications, the wavelet decomposition 
was shown to provide a multi-layer representation 
which is naturally compatible with lower definition 
video formats. Additionally, it facilitates desirable 
features, such as confidence viewing, and has the 
potential for prioritizing the error-control coding for 
recording purposes. The LOT scheme is a block-based 
technique offering substantial immunity against 
blocking artefacts. These are introduced at low 
bit-rates by block-based coders. The LOT-based 
technique also offers better multi-generation subjective 
performance than wavelets when no post-processing is 
introduced between generations of recordings. On the 
other hand, LOTs are computationally more expensive 
to implement than the DCT and wavelets. 

A significant, though not prohibitive, amount of 
distortion is introduced when different bit-rate 
reduction schemes are cascaded. This indicates that 
equipment incorporating different coding techniques 
can form part of a video processing chain in the studio, 
retaining HDTV quality for a limited number of 
generations. 

A review of difference signals between the original 
pictures and the decoded reconstructions, and of 
reconstructed picture close-ups, has shown that the 
wavelet-based and the LOT schemes are capable of 
superior portrayal of features which are normally 
difficult to code, such as higher frequency details, and 
are free from any blocking and noise artefacts usually 
associated with conventional block-based transform 
coders. 
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6. CONCLUSIONS 

In this report, novel intraframe source coding 
algorithms for digital HDTV recording have been 
discussed. Two main classes of transform-based 
techniques have been considered; Wavelet Transforms 
and Lapped Orthogonal Transforms. The wavelet- 
based technique has been based on a sub-band 
decomposition of the source picture obtained by spatial 
filtering with a suitable set of wavelet kernels. A 
predictive arithmetic coding technique was sub- 
sequently used to encode without loss individually 
quantised and segmented frequency sub-bands. For 
bit-rate reduction ratios in the range 4:1 - 8:1, useful 
for professional and consumer recording applications, 
significant mean-square error performance improve- 
ments over JPEG, the current state-of-the-art 



Part of the work described in this report was carried 
out in collaboration with NHK and their contribution is 
acknowledged with thanks. It has been a pleasure to 
work with T. Kato of NHK during his six-month stay 
at BBC R&D. 



8. REFERENCES 

1 . BURHELD, P.R., 1989. Digital HDTV recorder 
using a cluster of four D1 DVTRs, BBC 
Research Department Report No. 1 989/1 3. 

2. MALLAT, S.G., 1989. A Theory for Multi- 
resolution Signal Decomposition: The Wavelet 
Representation, IEEE Trans. PAMI, 11, July, 
pp. 674-693. 



(R016) 



14- 



3. MALLAT, S.G., 1989. Multifrequency Channel 
Decompositions of Images and Wavelet Models, 
IEEE Trans. ASSP, 37, December, pp. 2091-21 10. 

4. VETTERLI M. and HERLEY, C„ 1992. 
Wavelets and Filter Banks: Theory and Design, 
IEEE Trans. Sig. Proc, 40, September, 
pp. 2207-2232. 

5. ANTONINI, M., BARLAUD, M., MATHIEU, P. 
and DAUBECHIES, L, 1992. Image Coding 
Using Wavelet Transform, IEEE Trans. Image 
Proc, 1, April, pp. 205-220. 

6. LANGDON, G.G., Jr., 1984. An Introduction to 
Arithmetic Coding, IBM J. Res. Dev., 28, 
March, pp. 135-149. 

7. LANGDON, G.G., Jr., and RISSANEN, J., 
1981, Compression of Black and White Images 
with Arithmetic Coding, IEEE Trans. Comm., 
29, June, pp. 858-867. 

8. RISSANEN, J. and MOHIUDDIN, K.M., 1989. 
A Multiplication-Free Muitialphabet Arithmetic 
Code. IEEE Trans. Comm., 37, February, 
pp. 93-98. 



9. MAX, J., 1960. Quantizing for Minimum 
Distortion, IRE Trans. Info. Theory, 6, pp. 7-12, 

10. MORAN, D., VLACHOS, T,, MORRIS O. J., 
and CONSTANTINIDES, A.G., 1990. Region 
image coding techniques. Digest of the lEE Coll. 
on low bit-rate image coding. May, London. 

11. KATO,T., KUGE,T., MAJIMA, K., 
KURIOKA, T., OKUDA H., and OSHIMA, H., 
1994. A study on Bit- rate reduction for a 
Broadcast-use HDTV-VTR, Sig. Proc. of 
HDTV:V, pp. 565-572, Elsevier, Amsterdam. 

12. WALLACE, G.K., 1992. The JPEG Still Picture 
Compression Standard, IEEE Trans. Cons. 
Elec, 38, February, pp. 18-34. 

13. MALVAR, H.S., and STAELIN, D.H., 1989. 
The LOT: Transform Coding without Blocking 
Effects, IEEE Trans. ASSP, 37, April, 
pp. 553-559. 

1 4. KAUFF, P., STAMMNITZ P., and SCHAFFER, 
R., 1992. A CODEC for digital studio recording 
of HDTV, IBC, No. 358, pp. 112-116, July, 
Amsterdam. 



(R016) 



15- 



9. FIGURES I - IV 




Fig. I - First frame of HDTV sequence 'crowd'. 




Fig. 11 - Detailed view of 'crowd'. 



(R016) 



■16- 




Fig. lU(a} - Detailed view of J PEG -decoded 'crowd 




Fig. lll(b) - Detailed view of wavelet-decoded 'crowd'. 
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fig. IV(a) - Detail of amplified difference between original and JPEG-decoded 'crowd'. 






Fig. IV(b) - Detail of amplified difference between original and wavelet-decoded 'crowd'. 
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APPENDIX 

Bit-Stream syntax for the wavelet-based bit-rate reduction scheme 

An example of a bit-stream syntax for the wavelet-based bit-rate reduction scheme is presented. The example is 
based on a single image component (i.e. luminance) but generalises easily to accommodate more components (i.e. 
chrominance) as well. Sections of the bit-stream roughly correspond to the constiment parts of the multi-layer 
hierarchy achieved with the wavelet-based scheme. Given that the proposed coding technique is intraframe (for 
reasons already stated), the main recoverable building block of a coded sequence, is the frame. According to the 
hierarchy developed, each frame consists of sub-bands, each sub-band of segments and so on, 

The wavelet filter parameters are considered a priori known to both encoding and decoding ends and, 
consequently, are not shown as part of the syntax. 

For a particular frame, two separate sections of the bit-stream are considered; a section containing sub-band 1 (the 
lowest frequency sub-band) of that frame and a section containing any of the other (higher frequency) sub-bands. 

The syntax corresponding to the first section is shown Fig. Al.l. As already explained, sub-band 1 is not 
compressed further after quantisation but is PCM encoded instead. As a consequence, it occupies a fixed-length 
portion of the total bit-stream. Each sub-band I element is, in general, represented with a precision greater than 8 
bits. The exact precision depends upon the number of sub-bands and the gain of the wavelet filters employed. 
Assuming that 8 bit/pixel precision facilitates immediate data recovery (i.e. without activating any of the decoding 
circuits) for confidence viewing purposes, the Most Significant Bytes (MSBs) can be kept separate at the very 
beginning of a frame. The residual Least Significant bits (LSbs) can be arranged as shown in Fig, Al.l . 

The syntax corresponding to the second section is shown Fig. Al .2, This section contains the information required 
by the decoder in order to reconstruct each of the higher frequency sub-bands. This information consists of the 
number of segments, the offset applied to the sub-band prior to segmentation of the quantiser step-size, the 
amplitudes of uncoded boundary pixels (to combat ringing) and the segment code. The latter contains, in turn, the 
code length, mean boundary amplitudes (to facilitate prediction when the prediction template addresses undefined 
pixel positions) and the arithmetic code. Finally, optional headers to the latter could be used to convey information 
with regard to the shape of the prediction template and the statistical initialisation employed. 



frame 



frame 2 



Ibyte 



# sub-bands 



sub-band 1 



sub-band 2 



ffi 



sub-bond n 



ffl 



frame 1 



frame 2 



]bYte 



S sub-bonds 



sub-bond 1 



sub-band 2 



t 



sub-band 



3] 



] byte 1 byte 



MSB 1 



MSB_2 



n 

m 

I MSB_N 



LSb^N 



MSB_N 



LSb_N 



Tbyte 



MSB_N jlSb, 



LSb 2 



LSb_N 



fixed-lengtfi field 

variable-lengtti field 

most significant byte of the N-tfi element 
of sub-bond 1 

least significonf bits of tfie N-tti element 
of sub^band 1 

amplitude of N-thi element of sut>band 1 



/ byte 1 byte 1 byte 



segments 



offset 



quantiser 
weight 



boundorv 



segment^ 



|j.. ^egment| ^ 



kbytes 



I byte 



Ibyte 



coded segment lengtti 



boundary 
mean 1 



boundary 
mean 2 



arittimeti' 
code 



I 



I byte 



prediction template 
(optionol) 



statistics initiolisation 
(optionoi) 



coded 
segment 



I 



Fig. Al.l - Tentative bit-stream syntax for PCM encoded 
sub-band I. 



Fig. AJ.2 - Tentative bit-stream syniu.K for arithmetically 
encoded sub -bands. 
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